Translational Symmetry in Subsequence Time-Series Clustering
نویسنده
چکیده
We treat the problem of subsequence time-series clustering (STSC) from a group-theoretical perspective. First, we show that the sliding window technique introduces a mathematical artifact to the problem, which we call the pseudo-translational symmetry. Second, we show that the resulting cluster centers are necessarily governed by irreducible representations of the translational group. As a result, the cluster centers necessarily forms sinusoids, almost irrespective of the input time-series data. To the best of the author’s knowledge, this is the first work which demonstrates the interesting connection between STSC and group theory.
منابع مشابه
Why Does Subsequence Time-Series Clustering Produce Sine Waves?
Data mining and machine leaning communities were surprised when Keogh et al. (2003) pointed out that the k-means cluster centers in subsequence time-series clustering become sinusoidal pseudopatterns for almost all kinds of input time-series data. Understanding this mechanism is an important open problem in data mining. Our new theoretical approach (based on spectral clustering and translationa...
متن کاملA Review of Subsequence Time Series Clustering
Clustering of subsequence time series remains an open issue in time series clustering. Subsequence time series clustering is used in different fields, such as e-commerce, outlier detection, speech recognition, biological systems, DNA recognition, and text mining. One of the useful fields in the domain of subsequence time series clustering is pattern recognition. To improve this field, a sequenc...
متن کاملTheorectical Analysis of Subsequence Time-Series Clustering from a Frequency-Analysis Viewpoint
Although Subsequence Time Series (STS) clustering is one of the most popular pattern discovery techniques from timeseries data, a mathematical methodology for analyzing STS clustering (or pattern discovery from time-series data) has attracted little attention. In the situation, it has had a surprising report [10] that cluster centers obtained using STS clustering closely resemble ”sine waves” w...
متن کاملUseful Clustering Outcomes from Meaningful Time Series Clustering
Clustering time series data using the popular subsequence (STS) technique has been widely used in the data mining and wider communities. Recently the conclusion was made that it is meaningless, based on the findings that it produces (a) clustering outcomes for distinct time series that are not distinguishable from one another, and (b) cluster centroids that are smoothed. More recent work has si...
متن کاملSelective Subsequence Time Series clustering
0950-7051/$ see front matter 2012 Elsevier B.V. A http://dx.doi.org/10.1016/j.knosys.2012.04.022 ⇑ Corresponding author. Tel.: +66 8 9499 9400; fax E-mail addresses: [email protected] (S. Ro chula.ac.th (V. Niennattrakul), [email protected] Subsequence Time Series (STS) Clustering is a time series mining task used to discover clusters of interesting subsequences in time series data...
متن کامل